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FOREWORD 


This  technical  report  describes  a  constant  false  alarm  rate  (CFAR)  detection  algorithm  along 
with  a  thi-eshold  selection  algorithm  derived  for  target  detection  in  sonar  imagery.  It  is  designed 
for  multi-frequency  sonars  and  combines  the  advantages  of  template-based  detection  with 
environmental  adaptation.  This  combination  avoids  the  numerical  estimates  of  a  K-distribution 
based  statistical  test,  while  exploiting  geometric  features  and  frequency  response  differences 
between  the  local  environment  and  man-made  targets.  The  detection  algorithm  is  based  on  our 
analysis  of  processed  data  and  derived  from  a  general  statistical  model.  The  utility  of  CFAR 
algorithms  is  that  the  selection  of  a  detection  threshold  can  be  made  independently  of  most 
image  variations.  However,  to  account  for  the  widest  variety  we  apply  a  local  adaptive  threshold 
selection  mechanism  and  demonstrate  its  adaptability. 
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INTRODUCTION 


Detection  of  targets  in  sonar  imagery  presents  many  challenges  not  generally  found  in  optical 
imagery.  Sonars  that  are  considered  high  resolution  still  have  resolution  values  much  coarser 
than  most  optical  images.  This  can  be  seen  by  simple  visual  inspection  of  known  object 
representations  in  the  data. 

When  a  sonar  pulse  is  transmitted  into  the  water,  some  of  the  sound  reflects  off  of  the  target. 
Additionally,  there  are  many  other  sources  where  the  sound  energy  may  reflect  back  towards  the 
receiver  creating  interference.  This  scattering  is  caused  by  the  many  sources  of  inhomogeneities 
in  the  ocean.  These  sources  may  include  fish,  other  biologies,  air  bubbles,  dissolved  elements, 
and  particulates,  as  well  as  the  ocean  bottom,  and  surface,  whereas  atmospheric  scattering  of 
light  is  negligible  in  all  but  the  most  extreme  cases. 

Many  detection  algorithms  for  sonar  imagery  have  been  developed  to  address  this  challenging 
problem.  Each  exploits  different  data  characteristics  in  an  attempt  to  find  anomalous  items  such 
as  mine-like  objects,  unexploded  ordnance,  or  other  man-made  debris.  The  approaches  range 
tfom  as  general  as  possible  to  very  specific. 

One  general  approach  to  detection  and  feature  extraction  that  has  recently  been  explored  for 
sonar  imagery  is  Canonical  Coordinate  Analysis  (CCA).  The  canonical  coordinate 
decomposition  method  determines  linear  dependence  or  coherence  between  two  or  more  data 
channels.'’  ^  Canonical  coordinate  decomposition  allows  us  to  quantify  the  changes  between  the 
returns  tfom  the  bottom  and  when  target  activities  are  present  and  at  the  same  time  extract  useful 
features  for  target  classification  without  the  need  to  perform  separate  detection  and  anomaly 
feature  extraction.  This  method  is  not  based  on  geometry  to  allow  target  to  vary  greatly  in  shape 
and  orientation. 

At  the  other  end  of  the  spectrum  there  are  model-based  methods.  For  example,  an  Automatic 
Target  Recognition  (ATR)  algorithm  using  Scale-invariant  feature  transform  (SIFT)  features.^ 
This  method  physically  models  the  3D  underwater  scene.  It  generates  simulated  template  images 
for  both  the  target  and  environment.  Another  paper  uses  a  similar  starting  point."'  It  also  relies 
on  a  model-based  approach.  Each  one  uses  a  different  model  to  reproduce  three  elements  which 
are  key  to  sonar  ATR,  highlight  structure,  seabed  reverberation,  and  shadow  zones.  Both  model 
both  the  targets  and  the  backgrounds. 

Several  methods  have  also  used  the  fusion  of  multiple  Ifequency  bands  to  improve  detector 
performance.  An  adaptive  clutter  filter  detector  has  been  individually  applied  to  three  different 
sonar  images  varying  in  Ifequency  and  bandwidth.^’*’  Additional  views  of  an  object  can  be 
considered  in  a  similar  fashion.  Rather  than  correlating  the  image  data,  a  classification  algorithm 
has  been  employed  to  transform  the  multiple-aspect  classification  problem  into  a  multiple- 
instance  learning  problem.’ 

The  method  proposed  in  this  paper  is  along  the  continuum  between  target  agnostic  and  fully 
modeled  and  uses  multiple  Ifequency  bands  when  they  are  available.  The  use  of  all  of  these 
sophisticated  approaches  is  justified  since  the  modeling  of  sonar  data  leads  to  inexact  classical 
detection  statistics.  Initially,  sonar  data  was  widely  modeled  as  Rayleigh  distributed,  since  the 


I 


detected  envelope  of  the  sonar  return  at  any  given  time  is  assumed  to  be  the  square  root  of  the 
sum  of  the  energy  from  many  independent  complex  Gaussian-distributed  reflectors.*^  However, 
it  has  been  shown  that  in  cluttered  environments,  the  assumptions  of  the  Rayleigh  distribution 
approximation  are  inaccurate.'*  The  number  of  reflectors  in  a  local  area  may  be  reduced  due  to 
occlusion  and  the  viewing  angle  of  the  sensor,  at  increasing  ranges,  may  be  highly  correlated  in 
phase  due  to  natural  textures,  resulting  in  a  probability  model  for  pixel  intensity  known  as  the  K- 
distribution. 

The  K-distribution  is  a  compound  model  which  consists  of  Gaussian  speckle  modulated  by  a 
slowly  varying  mean  level,  this  local  mean  being  gamma  distributed.  There  is  no  closed  form 
solution  for  the  probability  of  detection  in  K-distributed  clutter,  so  numerical  methods  are  the 
only  current  method  of  determining  a  detection  threshold  value.'” 

Since  closed  form  statistical  tests  are  not  available  for  pixel  level  decisions,  we  introduce  a 
statistically  valid  application  of  the  Mahalanobis  distance  to  higher  level  processing.  The  result 
is  an  algorithm  that  allows  for  the  a-priori  selection  of  a  threshold  for  desired  performance,  also 
known  as  Constant  False  Alarm  Rate  (CFAR)  algorithm.  The  algorithm  exploits  the  covariance 
between  frequency  bands  and  can  be  applied  to  any  number  of  frequency  bands,  including  a 
single  band.  The  data  presented  in  this  paper  comes  from  a  sonar  system  with  two  such  bands. 

The  detection  algorithm  and  threshold  selection  method  described  in  this  paper  has  been 
presented  for  multispectral  optical  imagery  in  reference."  The  multispectral  application  did  not 
use  the  selectivity  that  comes  with  a  well-defined  target  model.  But,  it  did  show  that  the 
detection  output  could  vary  significantly  for  identical  targets  on  different  backgrounds,  thus 
necessitating  an  adaptive  threshold  method.  This  paper  adds  matched  filter  processing 
(increased  selectivity)  to  the  imagery  to  arrive  at  data  which  satisfies  the  statistical  hypothesis 
used  in  the  optical  application. 

The  algorithm  outline  is  presented  in  the  Algorithm  Outline  section.  The  supporting  reasoning  is 
provided  in  the  subsequent  sections  in  the  order  implemented  in  the  algorithm.  The  Matched 
Filter  section  describes  the  basis  of  the  test  statistic  developed  in  the  Statistical  Assessment 
section.  The  distribution  of  the  test  statistic  depends  only  on  the  number  of  frequency  bands  and 
the  number  of  pixels  in  each  sample.  Since  the  samples  may  be  intermixed  and  the  distribution 
skewed,  the  test  values  are  fit  to  a  flexible  model,  which  is  presented  in  the  Threshold  Selection 
section.  Fitting  the  actual  results  of  the  detection  algorithm  with  a  model  of  a  portion  of  the 
theoretical  output  enables  background-independent  threshold  selection.  A  desired  false  alarm 
rate  is  input  into  the  threshold  algorithm  and  the  detections  are  culled  accordingly.  Finally,  the 
Results  section  contains  the  results  of  the  algorithm  showing  the  reliability  of  the  threshold 
algorithm  over  several  background  types. 

ALGORITHM  OUTLINE 

Our  goal  is  to  develop  an  algorithm  with  an  automatic  threshold  selection  mechanism.  The 
statistical  distribution  of  pixel  values  in  a  sonar  image  is  generally  modeled  as  a  K-distribution. 
As  we  noted  in  the  introduction,  developing  a  statistical  test  based  on  this  distribution  is 
problematic.  Further,  such  a  test  would  be  valid  on  a  pixel  level,  but  would  ignore  the  additional 


2 


information  inherent  in  the  particular  object  recognition  problem  that  defines  our  goal.  There  are 
three  steps  in  the  algorithm. 

Several  of  these  basic  target  characteristics,  such  as  general  size  of  the  objects  and  the  highlight- 
shadow  relationship,  are  captured  in  a  template  or  matched  filter  type  of  detector.  This  is  the 
first  step.  Details  on  the  matched  filter  application  are  given  in  the  Matched  Filter  section. 

The  result  of  the  matched  filter  output  can  then  be  viewed  as  the  starting  point  of  a  more  general 
statistical  analysis.  Step  2  is  the  statistical  test  based  on  the  distribution  of  the  matched  filter 
correlations.  From  the  matched  filter  output  we  evaluate  target  sized  areas  and  surrounding 
background  correlation  values.  Sample  means  and  variances  are  calculated  for  these  areas  using 
convolution  masks. 

In  an  ideal  case,  a  threshold  can  be  chosen  based  on  the  number  of  Ifequencies  and  the  size  of  the 
samples  in  the  correlation  analysis.  We  generally  found  consistency  in  form,  but  shifts  in  actual 
values.  To  accommodate  these  shifts,  we  apply  a  simple  model  to  the  test  statistic  data  in  step  3. 
This  gives  us  a  dynamic  threshold  that  automatically  updates  based  on  data  characteristics.  The 
model  and  supporting  reasoning  is  explained  in  the  Statistical  Assessment  section.  The  chart  in 
Figure  1  shows  the  algorithm  outline. 


Frequency  1 


Frequency  2 


Frequency  N 


Figure  1.  Algorithm  Outline 


MATCHED  FILTER 


The  basis  of  the  matched  filter  detector  was  developed  by  Dobeck.'^  It  consists  of  an  image 
normalization  stage  followed  by  the  matched  filter  step.  The  purpose  of  the  normalization  is  to 
pre-condition  the  image  so  that  the  subsequent  step  is  robust  to  variations  in  background  level.  It 
reduces  variability  due  to  sonar  energy  variation,  motion  compensation  success,  and  other  sonar 
artifacts.  This  is  accomplished  by  normalizing  the  backgrounds  throughout  the  image  to  a 
constant  level  so  that  highlight  and  shadow  levels  are  consistent. 


The  normalizer  uses  a  forward  and  backward  filter  to  estimate  the  local  background  level. The 
Serpentine  Forward  filter  begins  at  the  top  of  the  image  and  selects  a  path  along  which  the  pixel 
values  within  the  filter  domain  best  follow  the  original  image  values.  The  path  is  generated 
recursively  by  extending  the  path's  latest  end  point  to  one  that  belongs  to  an  admissible  subset  of 
its  neighbors  whose  intensity  is  most  near  its  filtered  output  value.  As  the  Serpentine  Forward 
filter  progresses,  it  is  permitted  to  snake  to  the  right  or  left  in  order  to  follow  the  best  path.  The 
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Serpentine  Backward  filter  is  defined  analogous  to  the  Serpentine  Forward  filter,  but  the  image  is 
processed  Irom  bottom  to  the  top.  The  complete  algorithm  is  presented  in  reference  13. 

Following  normalization,  there  are  three  different  filters  that  are  each  applicable  to  specific 
regions  in  the  imagery.  In  particular,  the  shadow  length  in  the  high  Ifequency  imagery  increases 
as  a  function  of  range.  This  allows  the  filter  to  account  for  the  variability  of  the  background  and 
mine  signature  as  a  function  of  range.  One  example  of  a  matched  filter  with  a  short  shadow  is 
shown  in  Figure  2.  Highlights  are  represented  by  “H”  and  shadow  regions  are  represented  by 
“S.”  The  exact  values  depend  on  the  normalization  applied.  Our  normalization  is  designed  to 
give  highlight  values  close  to  one  and  shadow  values  close  to  zero. 


H 

s 

H 

H 

S 

s 

H 

S 

Figure  2.  Matched  Filter 

Each  matched  filter  mask  contains  three  distinct  regions;  highlight,  dead  zone,  and  shadow/post¬ 
target.  A  different  set  of  matched  filters  is  used  for  low  Ifequency  bands.  Because  shadow  is  not 
a  reliable  phenomenon  in  low  Ifequency  data  (due  to  diffraction  and  multi-path  effects),  the 
shadow  area  is  significantly  smaller.  The  matched  filters,  m(x,  y),  are  correlated  with  an  iV  x  M 
image,  /(x,y),  according  to 


N  M 

^  ^/(x,y)m(x  -i,y  -j).  (1) 

x=0 y=0 

The  distribution  of  the  matched  filter  output,  represents  both  positive  and  negative 

correlations.  Our  study  found  that  these  values  tend  to  have  a  Gaussian  distribution  for  each 
frequency  band  and  across  various  backgrounds.  These  observations  form  the  basis  of  the 
statistical  tests  developed  in  the  following  section. 

Figure  3  and  Figure  4  show  some  high  frequency  synthetic  aperture  sonar  data  that  were  part  of 
the  evaluation.  Figure  3  shows  a  nearly  uniform  seafloor.  Figure  4  is  more  complex,  with 
scattered  irregular  structures.  Figure  5  and  Figure  6  show  the  distributions  of  the  matched  filter 
outputs  for  high  and  low  frequencies  in  smooth  and  complex  backgrounds.  Figure  5  is  computed 
from  Figure  3  and  Figure  6  is  computed  from  Figure  4.  The  distribution  results  held  for  a  variety 
of  backgrounds  and  high  and  low  frequencies.  These  distributions  are  used  in  the  statistical  tests 
in  the  next  section. 
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Pixel  Value  Pixel  Value 

High  Frequency  Low  Frequency 

Figure  5.  Smooth  background  matched  filter  output 
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Pixel  Value  Pixel  Value 

High  Frequency  Low  Frequency 


Figure  6.  Complex  background  matched  filter  output 

The  statistical  assessment  of  the  following  section  is  based  on  all  the  values;  however,  the  values 
may  be  truncated  to  provide  only  positive  correlations.  It  is  appropriate  to  perform  this  operation 
at  this  step.  Highly  negative  correlations  are  statistical  anomalies,  but  do  not  likely  represent 
desired  targets. 

STATISTICAL  ASSESSMENT 

The  K-distribution  models  the  statistics  of  the  backscatter  from  a  coherent  imaging  process.  It 
has  successfully  modelled  radar  backscatter  from  land  and  the  sea  surface  and  sonar  images  of  a 
range  of  seabed  sediment  types.  The  generalized  K-distribution  provides  a  good  fit  to  all  the 
sediment  types,  including  rocky,  sea  grass,  muddy,  and  sandy.  The  K-distribution  is  expressed 
as 


P(z)  =  J”p(z|x)Pc(x)dx, 


where 


P(z|x)=-exp(  Vx) 


(2) 


(3) 


and 


Pc(x)=^^exp(-bx). 

To  determine  if  an  area  contains,  as  a  subset,  an  object  of  interest,  appropriate  hypotheses  are 
constructed.  The  null  hypothesis  is  that  there  is  no  object  present.  If  there  is  no  object  present, 
then  the  mean  of  the  surrounding  area  will  be  similar  to  the  mean  of  the  possible  target  area.  The 
alternative  hypothesis  is  that  there  is  an  object  present.  If  there  is  an  object,  then  the  two  means 
will  be  different.  The  hypotheses  are  not  based  on  an  exact  signature  and  therefore  adapt  to 
different  conditions. 
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The  calculation  of  the  probability  of  detection  in  K-distributed  clutter  and  noise  is  quite  difficult. 
The  probability  of  detection  equation  is  given  by 


^  CO 

PAY\x,N)  =  PMs,N)dfi, 
Jy 


where 


PMs,N)  =  (-) 


(«-!)/ 
li\  /2 


W=0 


r(iv  +  i) 


N-l+2i 


(5) 


(6) 


An  approximate  method  that  uses  linear  interpolation  in  a  table  has  been  developed.'"*  However, 
the  necessary  tables  of  coefficients  are  only  available  for  some  models.  Alternatively,  the 
probability  of  detection  can  be  calculated  by  numerical  integration  over  the  K-distribution  in 
which  the  K-distribution  is  separated  into  its  components  and  only  the  integration  over  the  local 
clutter  power  is  carried  out  numerically.  This  represents  a  faster  and  more  accurate  result,  but 
is  still  numerically  limited. 


The  rest  of  this  section  defines  the  method  to  produce  a  test  statistic  which  uses  the  matched 
filter  selectivity  and  the  normally  distributed  data  that  it  produced.  The  following  notation  uses 
B  and  T  for  background  and  target  as  applied  to  the  matched  filter  output,  not  the  original  sonar 
imagery.  Given  the  normal  distributions  shown  in  the  previous  section  (Figure  5  and  6)  with 
means  [ig  and  li-r  and  a  common  covariance,  E,  the  hypotheses  are 


^0  ■ Mb  —  Pt 


(V) 


^1-  Mb  a  fif. 


(8) 


The  means  are  calculated  by  convolving  the  image  with  masks  that  isolate  the  target  and  use  the 
surrounding  background.  The  masks  are  shown  pictorially  in  Figure  7.  The  center  region  is  the 
target  area.  There  is  a  guard  band  around  the  target  to  help  avoid  contamination  (the  white  area). 
The  guard  band  and  the  background  masks  are  annuli  around  the  target  area.  So,  the  three  masks 
form  concentric  and  non-intersecting  areas.  The  target  area  has  a  relatively  small  contribution  to 
the  covariance  calculation  and  can  be  omitted  if  the  targets  sought  are  only  a  few  pixels.  These 
masks  allow  us  to  calculate  sample  means,  (/Zg, /Tt’),  and  a  sample  covariance,  S. 
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Figure  7.  Mask  for  mean  calculation 

Broadband  sonar  can  be  divided  into  multiple  frequeney  bands.  Let  p  be  the  total  number  of 
bands.  Then,  eaeh  pixel  is  a  p-dimensional  sample  that  can  be  represented  by  the  eolumn  vector 

Xi  =  [xi,  X2, ... ,  Xp] .  ,  where  the  superseript  T  (non-italieized)  is  for  transpose. 

The  test  statistie  is  a  scalar-valued  funetion  of  the  possibly  multidimensional  quantities:  average 
target  eorrelation  value,  the  average  background  correlation  value,  and  the  common  covariance 
attributable  to  both  the  baekground  and  the  target. 

To  reliably  test  the  hypothesis  we  require  some  knowledge  of  the  distributions,  in  terms  of  the 
means  and  eovarianee.  In  most  practieal  applieations,  the  exaet  mean  and  eovariance  of  the 
baekground  and  target  are  unknown.  The  assumed  distributions  and  the  samples  ean  be  used  to 
elicit  estimators  for  these  parameters.  Maximum  likelihood  estimators  provide  good  estimates. 
Although  there  are  several  different  methods  for  estimating  unknown  statistieal  parameters  under 
fairly  general  eonditions  maximum  likelihood  estimators  satisfy  some  desirable  properties.  The 
maximum  likelihood  estimator  has  a  normal  distribution  with  a  mean  equal  to  the  true  parameter 
with  a  minimum  varianee. 

The  correlation  values  are  computed  over  the  disjoint  sets  (baekground  and  target),  defined  by 
the  masks  in  Figure  7.  The  total  sample,  X,  is  made  up  of  baekground  pixels,  given  by  5  = 

[b^,  b2, ... ,  bm^],  and  the  possible  target  pixels,  T  =  [to,  ti,  ••• ,  tNr]-  The  number  of  pixels  used 
in  the  detection  calculation  is  the  sum  of  the  two  samples,  denoted  hy  N  =  Ng  +  Nj.  The 

sample  vector  mean  of  the  baekground  pixels  is  given  by  ^  b>-B-  Similarly,  the  sample 

target  mean  is  ^  /]\j^  = 


The  null  hypothesis  defines  a  subset  of  the  parameter  space.  Let  to  be  the  set  of  points  in  the 
parameter  space  that  eorrespond  to  Hq  and  let  Q  be  the  set  of  all  points  in  the  parameter  spaee. 
For  our  example,  o)  =  {(/Ug,  —  Mr,  <  Mb,  Mr  <  ^^d  fi  =  {(/Tg,  —  cxi  < 

Mb,  Mr  <  By  utilizing  the  likelihood  function  we  ean  judge  if  a  particular  sample  of  the 
background  and  target  does  not  support  Hq.  Let  the  sample  be  given  and  let  T(d))  be  the 
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likelihood  function  maximized  with  respect  to  parameters  in  co,  and  let  L(n)  be  the  likelihood 
function  maximized  with  respect  to  parameters  in  Q.  The  likelihood  ratio  is 


L(fi) 


-  PrYA-^  -  PrY_ 


(9) 


where 


Nq  Ny 

A  =  -  ]lj{bi  -  Ji^  +  -  ]iy . 

i=l  i=l 


(10) 


The  inequality  <  k  is  equivalent  to 


F(X)  -  ^  -  iij  >  (k-%  -  l)  (iV  -  2), 


(11) 


withS  =  ^/(^_2). 

The  criterion  states  that  the  critical  region  for  a  test  of  Hq  against  H^is  defined  by  the  set  of 
points  in  the  sample  space  for  which  T(m)  /KJk)  <  k,  where  k  is  selected  so  that  it  has  the 
desired  level  of  significance.  The  value  F{X)  is  our  detection  statistic. 

DETECTION  AND  FA  PROBABILITIES 


Recalling  that  the  distributions  of  the  background  and  target  pixels  were  normally  distributed, 
AiCjUg,  S)  and  iVC/r^,  S),  the  sample  means  of  the  background,  /fg,  and  target,  ii-j-,  are  distributed 

according  to  N  VjVg  ^  ^/Nj  ^)’  respectively.  Additionally, 


S  = 


NbNt 


V. 


•/ 


N 


(Mb  Ft) 


(12) 


is  normally  distributed  with  zero  mean  and  covariance  E. 
The  term  S  was  derived  from 


Ng  N'f 

A  =  -  ]lj{bi  -  ]iy  +  -  ]iy . 

i=l  i=l 


(13) 


Because  A  can  be  written  as  St  where  each  Zt  is  independent  with  distribution  iV(0,  S)  so 
can  (iV  —  2)5.  Under  these  conditions,  it  can  be  shown  that  F(X)  is  distributed  as  a  noncentral  F 
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with  p  and  N-p-\  degrees  of  freedom  and  5  as  the  noneentrality  parameter.'^  A  graph  of  the  F- 
distribution  is  shown  in  Figure  8. 


0  1  2  3  4  5 


X 


Figure  8.  F-Distribution 


Sinee  the  noneentrality  parameter  is  nonzero  under  ffi,  computation  of  the  probability  of 
detection  requires  knowledge  of  the  target  and  background  distributions.  The  probability  of 
detection  is  equal  to  the  probability  of  rejecting  Hq,  when  is  true.  This  is  calculated  as 


=  />{ 


N  —  p 


N„N. 


'  Up(iV-2)J  N 

rA  -  p  -  1] 


> 


Lp(A-2) 


=  1 


(Mb  ^(Pb  Pt) 

(/c“^/iv  -  1)(A  -  2)|Ai  truej 

^1- 


(14) 


The  number  k  is  the  threshold  for  the  detection  metric  and  p  denotes  a  noncentral  F- 
distribution  with  5  as  the  noneentrality  parameter  and  p  and  N  —  p  —  1  degrees  of  freedom. 


The  probability  of  a  false  alarm  Pfg  is  equal  to  the  probability  of  rejecting  Aowhen  Hq  is  true. 
The  noneentrality  parameter  becomes  zero  under  Hq.  The  value  Pfg  is  given  by 


P(^  =  P 


NbNt 


N 


(Pb  -  Pt)^F  Hpb  -  Pt)  >  (P  -  l)(A  -  2)|Ao  true 


~  1  Po,p,N-p-l 


N  —  p 


)( 


k-2/"  - 1) 


(15) 


where  Fo,p,N-p-i  denotes  a  central  F-distribution  with  p  and  N  —  p  —  1  degrees  of  freedom. 

THRESHOLD  SELECTION 


Ideally  the  threshold  is  selected  based  on  the  F-distribution  using  the  number  of  pixels  in  the 
samples,  N,  and  the  number  of  sonar  bands,  p.  However,  the  probability  of  detection  requires 
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specific  knowledge  of  the  true  means  of  the  correlation  values  and  there  may  be  shifts  in  the 
correlation  values  that  also  have  an  effect  on  the  probability  of  false  alarm.  Additionally,  the 
lower  values  of  the  F-distribution  are  affected  by  loss  of  significant  figures  that  stems  from  the 
subtractions  in  the  F(X)  calculation.  Both  of  these  shortcomings  are  overcome  by  an 
approximation  to  the  significant  portion  of  the  F-distribution  as  determined  by  the  observed  final 
values. 

The  F-distribution,  after  its  peak  value,  can  be  approximated  with  the  exponential  distribution. 
The  peak  can  easily  be  identified  on  a  local  basis,  image  by  image  or  other  suitable  partition. 

The  probability  density  function  corresponding  to  the  exponential  distribution  is  g{_x)  = 


(V/r)exp(  7^). 


Whereas  the  F-distribution  had  p  and  N  as  parameters,  the  exponential  distribution  has  //,  the 
distribution  mean,  as  a  single  parameter.  The  mean  is  estimated  from  the  final  values  above  the 
F-distribution  peak  value.  Further,  this  estimation  enables  the  distribution  to  shift  independently 
from  image  to  image  based  on  the  observed  data.  The  derivation  of  the  F-distribution  assumed 
that  the  correlation  values  followed  the  Gaussian  distribution.  Using  the  modeled  exponential 
distribution,  we  are  better  able  to  handle  deviations. 

Figure  9  is  a  histogram  of  detection  values  fit  with  an  exponential  density  function.  Flistograms 
are  computed  and  fit  for  each  image.  The  fit  is  rather  poor  for  the  first  few  bins,  but  is  fairly 
accurate  in  the  last  few.  In  cases  where  only  a  few  false  alarms  are  desired,  it  is  the  last  few  that 
are  responsible  for  agreement  between  the  desired  number  of  false  alarms  and  the  number 
produced  by  the  threshold  selection  method. 


Statistic  Value 


Figure  9.  Detection  values  fit  with  exponential  distribution 


The  mean  is  estimated  using  the  detection  values  as  discrete  samples  of  the  continuous 
exponential  distribution.  The  desired  number  of  false  alarms  per  image  is  calculated  using  the 
distribution  function,  G,  and  the  number  of  samples,  N.  The  distribution  function  is  given  by 


0,  —CXI  <  X  <  0 

1  —  exp(— x//t),  0  <  X  <  CXI. 


(16) 
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If  K  is  the  desired  number  of  elements  that  are  expeeted  to  be  greater  than  a  threshold  T,  then  the 
equation  is  [1  —  G(r)]iV  <  K.  Solving  for  T  gives: 


1  -  [l  -  exp(7/i)]  <  ^ 


(17) 


T  >  -lilniK/N). 


(18) 


Using  equality  in  the  final  equation  for  T  assures  that  the  threshold  is  calculated  to  be  the  lowest 
possible  to  achieve  the  desired  number  of  false  alarms.  A  consequence  of  selecting  the  lowest 
threshold  is  that  it  is  not  uncommon  that  K  false  alarms  are  encountered. 


RESULTS 

A  range  of  desired  false  alarm  values  were  selected  for  testing.  The  test  set  consisted  of  35 1  dual 
Ifequency  synthetic  aperture  sonar  images  made  up  of  both  uniform  and  uneven  backgrounds. 
The  wide  variety  of  backgrounds  was  necessary  to  show  adaptability  of  the  threshold  selection 
method.  The  threshold  is  dynamically  selected  for  each  image  based  on  the  overall  selected 
number  of  desired  false  alarms.  Figure  10  is  a  histogram  of  the  threshold  values.  The  thresholds 
are  concentrated  in  the  teens,  but  vary  over  a  significant  range  as  the  method  adapts  to  differing 
backgrounds. 


10  20  30  40  50  60  70  80 

Threshold  Value 

Figure  10.  Threshold  dynamics 


The  selected  values  and  the  actual  false  alarm  rates  (FAR)  are  given  in  Table  1.  The  FAR  shown 
is  average  false  alarms  per  image.  There  is  a  noteworthy  discrepancy  between  the  K- value  and 
the  actual  number  of  false  alarms.  However,  the  relationship  is  nearly  linear  across  the  tested 
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values.  Thus,  it  is  possible  to  predict  performance  and  choose  a  k-value  based  on  the  sought 
after  number  of  false  alarms,  which  is  the  desired  property. 

Table  1.  Desired  vs.  Actual  False  Alarm  Rates 


K  values 

5.0 

2.0 

1.0 

0.5 

0.25 

0.1 

0.01 

Actual  FAR 

32.99 

10.45 

4.98 

2.81 

1.89 

1.36 

0.88 

We  also  compared  this  detection  method  with  the  CCA  method.^  This  FX  based  method 
calculates  a  dynamic  threshold  and  so  a  traditional  Receiver  Operating  Characteristic  (ROC) 
curve  is  not  directly  applicable.  While  it  is  possible  to  produce  a  ROC-like  curve  for  the  F(X) 
method  using  various  k  values,  the  actual  threshold  applied  to  the  data  is  not  constant  for  any 
particular  value.  The  graph  of  the  two  methods  is  shown  in  Figure  1 1 .  The  vertical  axis 
represents  the  percentage  of  mine-ltke  targets  found  in  the  data.  The  solid  curve  is  the  ROC  for 
the  CCA  method.  The  circles  are  the  performance  of  the  F(X)  method  for  distinct  k  values  (as 
indicated  by  the  adjacent  text). 


Figure  11:  FXvs.  CCA 


Detection  is  typically  followed  by  feature  extraction  and  classification  in  automatic  target 
recognition.  Classification  methods  can  be  computationally  intensive  and  so  the  function  of 
the  detection  stage  is  to  eliminate  most  of  the  data,  so  that  the  more  intensive  algorithmic 
elements  are  not  overburdened.  The  competing  requirement  is  that  targets  of  interest  are  not 
filtered  out.  The  graph  in  Figure  1 1  shows  that  for  a  fixed  number  of  false  alarms,  the  FX 
method  passes  more  detections  than  the  CCA  method,  particularly  in  the  one-to-three  false 
alarms  per  image  values. 

CONCLUSION 

This  work  presents  a  constant  false  alarm  rate  (CFAR)  detection  and  threshold  selection 
algorithm  for  sonar  imagery.  It  uses  a  multi- frequency  matched  filter  element  for  selectivity  and 
is  based  on  the  statistical  properties  of  that  output.  These  properties  allow  for  the  use  of  an 
automated  threshold  selection  procedure.  Rather  than  trying  to  experimentally  adjust  a  threshold 
to  achieve  a  desired  output,  a  user  can  select  the  desired  output  and  allow  the  procedure  to 
dynamically  set  the  threshold.  The  desired  output  is  determined  by  a  clear  and  obvious  process. 
This  entire  algorithm  is  designed  to  avoid  the  numerical  estimates  of  a  K-distribution  based 
statistical  test,  while  exploiting  geometric  features  and  frequency  response  differences  between 
the  local  environment  and  man-made  targets. 
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